The Effect of Sound Masking on Speech Recognition
نویسنده
چکیده
In the current study, two experiments are reported that investigated the e ects of simple white noise and mixture of white noise and other sounds on perception of speech. In both experiments, university students were recruited to listen to short sentences under various sound masking conditions. Experiment 1, where standard sets of speakers were used for both speech and masking stimuli, has shown that, compared to baseline where there was no masking sound, the participants had significantly greater difficulties in understanding the sentences where the average level of understanding was 28% for the white noise condition and 20% for the mixed noise condition in which white noise was mixed with pink noise and sounds of running water. In Experiment 2, a test model of the specially designed sound masking speaker was used to present the masking noise. Further, sounds of tweeting birds and healing music were added to the mixed noise from Experiment 1 to create the three masking noise conditions. The average level of understanding for the mixed noise condition was 14%, while that for the bird and music conditions were 24% and 30% respectively. The higher understanding rates for the latter conditions were due to lower volume of the mixed white noise in order to keep the overall volume including the birds and music at 55dB. ere were also signi cant e ects of sentence type and reading voice gender, suggesting that auditory legibility does not solely depend on the speech-to-noise sound level ratio, but also on other variables, such as, predictability of the sentences, and clarity of the speech. Feedback at the end of the sessions revealed that the participants found mixed noise less irritating than pure white noise, and they preferred mixed noise with bird tweeting or music even better. us, it was concluded that mixed noise with occasional sounds of tweeting birds, was the most suitable masking sound for commercial use, being e cient and not unpleasant.
منابع مشابه
Classification of emotional speech using spectral pattern features
Speech Emotion Recognition (SER) is a new and challenging research area with a wide range of applications in man-machine interactions. The aim of a SER system is to recognize human emotion by analyzing the acoustics of speech sound. In this study, we propose Spectral Pattern features (SPs) and Harmonic Energy features (HEs) for emotion recognition. These features extracted from the spectrogram ...
متن کاملOptimization of Wavelet Packets to Minimize the Effect of Spectral Masking for Improving Speech Perception
Spectral masking occurs when the perception of one sound is affected by the presence of another sound like noise or unwanted sound of the same duration as the original sound. Earlier studies have shown that binaural dichotic presentation, using a pair of linear phase FIR comb filters with complementary magnitude responses, helps in reducing the effect of spectral masking. In the present study a...
متن کاملPerspectives on wanted and unwanted sounds in outdoor environments Studies of masking, stress recovery, and speech intelligibility
Perspectives on wanted and unwanted sounds in outdoor environments Studies of masking, stress recovery, and speech intelligibility Jesper Alvarsson To my too tolerant partner and children vii Abstract An acoustic environment contains sounds from various sound sources, some generally perceived as wanted, others as unwanted. This thesis examines the effects of wanted and unwanted sounds in acoust...
متن کاملPersian Cued Speech: The Effect on the Perception of Persian Language Phonemes and Monosyllabic Words with and without Sound in Hearing Impaired Children
Objectives: This paper studies the effect of Persian Cued Speech on the perception of Persian language phonemes and monosyllabic words with and without sound in hearing impaired children. Cued Speech is a sound based mode of communication for hearing impaired people that is comprised of a limited series of hand complements and the normal pattern of speech. And it is shown that it effectively ca...
متن کاملEffect of masker type and age on speech intelligibility and spatial release from masking in children and adults.
Speech recognition in noisy environments improves when the speech signal is spatially separated from the interfering sound. This effect, known as spatial release from masking (SRM), was recently shown in young children. The present study compared SRM in children of ages 5-7 with adults for interferers introducing energetic, informational, and/or linguistic components. Three types of interferers...
متن کامل